66 research outputs found

    SMS spam filtering using probabilistic topic modelling and Stacked Denoising Autoencoder.

    Get PDF
    In This paper we present a novel approach to spam filtering and demonstrate its applicability with respect to SMS messages. Our approach requires minimum features engineering and a small set of labelled data samples. Features are extracted using topic modelling based on latent Dirichlet allocation, and then a comprehensive data model is created using a Stacked Denoising Autoencoder (SDA). Topic modelling summarises the data providing ease of use and high interpretability by visualising the topics using word clouds. Given that the SMS messages can be regarded as either spam (unwanted) or ham (wanted), the SDA is able to model the messages and accurately discriminate between the two classes without the need for a pre-labelled training set. The results are compared against the state-of-the-art spam detection algorithms with our proposed approach achieving over 97 % accuracy which compares favourably to the best reported algorithms presented in the literature

    Using Model Explanations to Guide Deep Learning Models Towards Consistent Explanations for EHR Data

    Get PDF
    It has been shown that identical Deep Learning (DL) architectures will produce distinct explanations when trained with different hyperparameters that are orthogonal to the task (e.g. random seed, training set order). In domains such as healthcare and finance, where transparency and explainability is paramount, this can be a significant barrier to DL adoption. In this study we present a further analysis of explanation (in)consistency on 6 tabular datasets/tasks, with a focus on Electronic Health Records data. We propose a novel deep learning ensemble architecture that trains its sub-models to produce consistent explanations, improving explanation consistency by as much as 315% (e.g. from 0.02433 to 0.1011 on MIMIC-IV), and on average by 124% (e.g. from 0.12282 to 0.4450 on the BCW dataset). We evaluate the effectiveness of our proposed technique and discuss the implications our results have for both industrial applications of DL and explainability as well as future methodological work

    SMS Spam Filtering using Probabilistic Topic Modelling and Stacked Denoising Autoencoder

    Get PDF
    In This paper we present a novel approach to spam filtering and demonstrate its applicability with respect to SMS messages. Our approach requires minimum features engineering and a small set of labelled data samples. Features are extracted using topic modelling based on latent Dirichlet allocation, and then a comprehensive data model is created using a Stacked Denoising Autoencoder (SDA). Topic modelling summarises the data providing ease of use and high interpretability by visualising the topics using word clouds. Given that the SMS messages can be regarded as either spam (unwanted) or ham (wanted), the SDA is able to model the messages and accurately discriminate between the two classes without the need for a pre-labelled training set. The results are compared against the state-of-the-art spam detection algorithms with our proposed approach achieving over 97 % accuracy which compares favourably to the best reported algorithms presented in the literature

    Collaborative denoising autoencoder for high glycated haemoglobin prediction.

    Get PDF
    A pioneering study is presented demonstrating that the presence of high glycated haemoglobin (HbA1c) levels in a patient’s blood can be reliably predicted from routinely collected clinical data. This paves the way for performing early detection of Type-2 Diabetes Mellitus (T2DM). This will save healthcare providers a major cost associated with the administration and assessment of clinical tests for HbA1c. A novel collaborative denoising autoencoder framework is used to address this challenge. The framework builds an independent denoising autoencoder model for the high and low HbA1c level, which extracts feature representations in the latent space. A baseline model using just three features: patient age together with triglycerides and glucose level achieves 76% F1-score with an SVM classifier. The collaborative denoising autoencoder uses 78 features and can predict HbA1c level with 81% F1-score

    Improving the multiobjective evolutionary algorithm based on decomposition with new penalty schemes

    Get PDF
    It has been increasingly reported that the multiobjective optimization evolutionary algorithm based on decomposition (MOEA/D) is promising for handling multiobjective optimization problems (MOPs). MOEA/D employs scalarizing functions to convert an MOP into a number of single-objective subproblems. Among them, penalty boundary intersection (PBI) is one of the most popular decomposition approaches and has been widely adopted for dealing with MOPs. However, the original PBI uses a constant penalty value for all subproblems and has difficulties in achieving a good distribution and coverage of the Pareto front for some problems. In this paper, we investigate the influence of the penalty factor on PBI, and suggest two new penalty schemes, i.e., adaptive penalty scheme and subproblem-based penalty scheme (SPS), to enhance the spread of Pareto-optimal solutions. The new penalty schemes are examined on several complex MOPs, showing that PBI with the use of them is able to provide a better approximation of the Pareto front than the original one. The SPS is further integrated into two recently developed MOEA/D variants to help balance the population diversity and convergence. Experimental results show that it can significantly enhance the algorithmΓ’οΏ½οΏ½s performance. © 2016, Springer-Verlag Berlin Heidelberg

    S100A7-Downregulation Inhibits Epidermal Growth Factor-Induced Signaling in Breast Cancer Cells and Blocks Osteoclast Formation

    Get PDF
    S100A7 is a small calcium binding protein, which has been shown to be differentially expressed in psoriatic skin lesions, as well as in squamous cell tumors of the skin, lung and breast. Although its expression has been correlated to HER+ high-grade tumors and to a high risk of progression, the molecular mechanisms of these S100A7-mediated tumorigenic effects are not well known. Here, we showed for the first time that epidermal growth factor (EGF) induces S100A7 expression in both MCF-7 and MDA-MB-468 cell lines. We also observed a decrease in EGF-directed migration in shRNA-downregulated MDA-MB-468 cell lines. Furthermore, our signaling studies revealed that EGF induced simultaneous EGF receptor phosphorylation at Tyr1173 and HER2 phosphorylation at Tyr1248 in S100A7-downregulated cell lines as compared to the vector-transfected controls. In addition, reduced phosphorylation of Src at tyrosine 416 and p-SHP2 at tyrosine 542 was observed in these downregulated cell lines. Further studies revealed that S100A7-downregulated cells had reduced angiogenesis in vivo based on matrigel plug assays. Our results also showed decreased tumor-induced osteoclastic resorption in an intra-tibial bone injection model involving SCID mice. S100A7-downregulated cells had decreased osteoclast number and size as compared to the vector controls, and this decrease was associated with variations in IL-8 expression in in vitro cell cultures. This is a novel report on the role of S100A7 in EGF-induced signaling in breast cancer cells and in osteoclast formation

    Nuclear S100A7 Is Associated with Poor Prognosis in Head and Neck Cancer

    Get PDF
    Tissue proteomic analysis of head and neck squamous cell carcinoma (HNSCC) and normal oral mucosa using iTRAQ (isobaric tag for relative and absolute quantitation) labeling and liquid chromatography-mass spectrometry, led to the identification of a panel of biomarkers including S100A7. In the multi-step process of head and neck tumorigenesis, the presence of dysplastic areas in the epithelium is proposed to be associated with a likely progression to cancer; however there are no established biomarkers to predict their potential of malignant transformation. This study aimed to determine the clinical significance of S100A7 overexpression in HNSCC.Immunohistochemical analysis of S100A7 expression in HNSCC (100 cases), oral lesions (166 cases) and 100 histologically normal tissues was carried out and correlated with clinicopathological parameters and disease prognosis over 7 years for HNSCC patients. Overexpression of S100A7 protein was significant in oral lesions (squamous cell hyperplasia/dysplasia) and sustained in HNSCC in comparison with oral normal mucosa (p(trend)<0.001). Significant increase in nuclear S100A7 was observed in HNSCC as compared to dysplastic lesions (p = 0.005) and associated with well differentiated squamous cell carcinoma (p = 0.031). Notably, nuclear accumulation of S100A7 also emerged as an independent predictor of reduced disease free survival (p = 0.006, Hazard ratio (HR = 7.6), 95% CI = 1.3-5.1) in multivariate analysis underscoring its relevance as a poor prognosticator of HNSCC patients.Our study demonstrated nuclear accumulation of S100A7 may serve as predictor of poor prognosis in HNSCC patients. Further, increased nuclear accumulation of S100A7 in HNSCC as compared to dysplastic lesions warrants a large-scale longitudinal study of patients with dysplasia to evaluate its potential as a determinant of increased risk of transformation of oral premalignant lesions

    Selective laser trabeculoplasty: past, present, and future

    Get PDF
    Over the past two decades, selective laser trabeculoplasty (SLT) has increasingly become an established laser treatment used to lower intraocular pressure in open-angle glaucoma and ocular hypertensive patients. In this review we trace the origins of SLT from previous argon laser trabeculoplasty and review the current role it has in clinical practice. We outline future directions of SLT research and introduce emerging technologies that are further developing this intervention in the treatment paradigm of glaucoma.Eye advance online publication, 5 January 2018; doi:10.1038/eye.2017.273

    Multi-objective particle swarm optimisation: methods and applications

    Get PDF
    Solving real life optimisation problems is a challenging engineering venture. Since the early days of research on optimisation it was realised that many problems do not simply have one optimisation objective. This led to the development of multi-objective optimizers that try to look at the optimisation problem from di erent points of view and reach a set of compromised solutions among the di erent objectives. The presented research brings together recent advances in the eld of multi-objective optimisation and particle swarm optimisation raising several challenges. This is tackled from di erent aspects including the proposal of new archiving techniques to developing new methods and quality measures. Smart Multi-objective Particle Swarm Optimisation based on Decomposition (SDMOPSO) is rst proposed to incorporate multi-objective problem decomposition techniques with PSO. A novel archiving technique is developed using a clustering based mapping approach between the objective and solution spaces and is applied to general multi-objective optimizers. D2MOPSO is introduced as a new MOPSO that uses problem decomposition and a new archive utilising dominance based mapping between objective and solution spaces. Finally the thesis presents a novel multi-objective quality measure that uses mutual information to compare among solutions generated by di erent algorithms. The contributions are all tested on standard test suits and are used to solve two real-life problems: a) Channel selection for Brain-Computer Interfaces, and b) E ective cancer chemotherapy treatments. The two problems are real challenges in the two respective elds. Two di erent modelling approaches of the channel selection problem are presented: one is based on binary representation of the channels, while the other is continuous in a projected space of the channel locations. The results are very competitive with the commonly used methods
    • …
    corecore